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Using a web-based categorization approach to generate thematic metadata from texts 
Chien-Chung Huang, Shui-Lung Chuang, Lee-Feng Chien 

September 2004 ACM Transactions on Asian Language Information Processing (TALIP), 

Volume 3 Issue 3 

Full text available: ^ pdf(317.89 KB) Additional Information: full citation, abstract , references , index terms 

Conventional tools for automatic metadata creation mostly extract named entities or text 
segments from texts and annotate them with information about persons, locations, dates, 
and so on. However, this kind of entity type information is often insufficient for machines to 
understand the facts contained in the texts, thus precluding the possibility of implementing 
more advanced, intelligent applications, such as concept-based search. In this work, we try 
to create more refined thematic metadata ... 
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Woojin Paik, Sibel Yilmazel, Eric Brown, Maryjane Poulin, Stephane Dubon, Christophe Amice 
October 2001 Proceedings of the international conference on Knowledge capture 

Full text available: ^pdf(210.42 KB) Additional Information: full citation , abstract , references , index terms 

This paper describes a metadata extraction technique based on natural language processing 
(NLP) which extracts personalized information from email communications between financial 
analysts and their clients. Personalized means connecting users with content in a personally 
meaningful way to create, grow, and retain online relationships. Personalization often results 
in the creation of user profiles that store individuals' preferences regarding goods or services 
offered by various e-commerce merch ... 
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Full text available: ^ pdf(280.05 KB) Additional Information: full citation, abstract , references, index terms 

Text categorization is typically formulated as a concept learning prob lem where each 
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